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the Euler discretization of the overdamped Langevin stochastic differential 
equation associated with tt. For both constant and decreasing step sizes in 
the Euler discretization, we obtain non-asymptotic bounds for the conver¬ 
gence to the target distribution tt in total variation distance. A particular 
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1. Introduction 

Sampling distributions over high-dimensional state-spaces is a problem which 
has recently attracted a lot of research efforts in computational statistics and 
machine learning (see [11] and [1] for details); applications include Bayesian 
non-parametrics, Bayesian inverse problems and aggregation of estimators. All 
these problems boil down to sample a target distribution tt having a den¬ 
sity w.r.t. the Lebesgue measure on known up to a normalisation factor 
X !->■ / /jjd e“*^(^)dy where U is continuously differentiable. We consider a 

sampling method based on the Euler discretization of the overdamped Langevin 
stochastic differential equation (SDE) 

dYt = -VUiYt)dt + V2dBf , (1) 

where is a d-dimensional Brownian motion. It is well-known that the 

Markov semi-group associated with the Langevin diffusion {Yt)t>o is reversible 
w.r.t. TT. Under suitable conditions, the convergence to tt takes place at geomet¬ 
ric rate. Precise quantitative estimates of the rate of convergence with explicit 
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dependency on the dimension d of the state space have been recently obtained 
using either functional inequalities such as Poincare and log-Sobolev inequali¬ 
ties (see [3, 9] [4]) or by coupling techniques (see [15]). The Euler-Maruyama 
discretization scheme associated to the Langevin diffusion yields the discrete 
time-Markov chain given by 

Xk+i = Xk — ^k->ri^U{Xk) + \/2ji^Zk+i ( 2 ) 

where {Zk)k>i is an i.i.d. sequence of standard Gaussian d-dimensional random 
vectors and ( 7 fc)fe>i is a sequence of step sizes, which can either be held constant 
or be chosen to decrease to 0. The idea of using the Markov chain {Xk)k>o to 
sample approximately from the target tt has been first introduced in the physics 
literature by [34] and popularised in the computational statistics community 
by [17] and [18]. It has been studied in depth by [35], which proposed to use 
a Metropolis-Hastings step at each iteration to enforce reversibility w.r.t. tt 
leading to the Metropolis Adjusted Langevin Algorithm (MALA). They coin 
the term unadjusted Langevin algorithm (ULA) when the Metropolis-Hastings 
step is skipped. 

The purpose of this paper is to study the convergence of the ULA algorithm. 
The emphasis is put on non-asymptotic computable bounds; we pay a particular 
attention to the way these bounds scale with the dimension d and constants 
characterizing the smoothness and curvature of the potential U. Our study 
covers both constant and decreasing step sizes and we analyse both the ’’finite 
horizon” (where the total number of simulations is specified before running the 
algorithm) and ”any-time” settings (where the algorithm can be stopped after 
any iteration). 

When the step size 7 ^ = 7 is constant, under appropriate conditions (see 
[35]), the Markov chain (A„)„>o is U-uniformly geometrically ergodic with a 
stationary distribution ir-y. With few exceptions, the stationary distribution is 
different from the target tt. If the step size 7 is small enough, then the stationary 
distribution of this chain is in some sense close to tt. We provide non-asymptotic 
bounds of the U-total variation distance between ir-y and tt, with explicit depen¬ 
dence on the step size 7 and the dimension d. Our results complete and extend 
the recent works by [13] and [12]. 

When ( 7 fe)fc>i decreases to zero, then {Xk)k>o is a non-homogeneous Markov 
chain. If in addition X[fe=i show that the marginal distribution of 

this non-homogeneous chain converges, under some mild additional conditions, 
to the target distribution tt, and provide explicit bounds for the convergence. 
Compared to the related works by [23], [24], [25] and [26], we establish not 
only the weak convergence of the weighted empirical measure of the path to the 
target distribution but a much stronger convergence in total variation, similarly 
to [ 12 ], where the strongly log-concave case is considered. 

The paper is organized as follows. In Section 2, the main convergence results 
are stated under abstract assumptions. We then specialize in Section 3 these 
results to different classes of densities. The proofs are gathered in Section 4. 
Some general convergence results for diffusions based on reflection coupling, 
which are of independent interest, are stated in Section 5. 
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Notations and conventions 


denotes the Borel cr-field of and F(R'^) the set of all Borel measur¬ 
able functions on For / G F(R‘^) set ||/||oo = sup^ggd |/(a:)|. Denote by 
M(K‘^) the space of finite signed measure on (K‘^, S(R.'^)) and Mo(R.‘^) = {ft G 
M(K‘^) I fL(W‘) = 0}. For ft G M(]R'^) and / G F(]R'^) a /i-integrable function, de¬ 
note by the integral of / w.r.t. ft. Let V : R.'^ —>■ [1, oo) be a measurable func¬ 
tion. For / G F(R‘^), the F-norm of / is given by ||/||y = sup 3 ,gRd \f{x)\/V{x). 
For ft G M(R‘^), the F-total variation distance of ft is defined as 



If F = 1, then || • ||y is the total variation denoted by || • ||tv- 

Forp > 1, denote by L?’(7r) the set of measurable functions such that 7r(|/|P) < 
00 . For / G L^(7r), the variance of / under tt is denoted by Var^ {/}. For all 
functions / such that /log(/) G L^(7r), the entropy of / with respect to tt is 
defined by 



Let ft and v be two probability measures on R'^. If /r <C we denote by dft/dv 
the Radon-Nikodym derivative of ft w.r.t. z/. Denote for all x,y gM.‘^ by {x,y) 
the scalar product of x and y and ||x|| the Euclidean norm of x. For k > 0, denote 
by C'*^(R‘^), the set of fc-times continuously differentiable functions / : R"^ —>■ R. 
For / G C'^(R'^), denote by V/ the gradient of / and A/ the Laplacian of /. 
For all a: G R"^ and M > 0, we denote by B{x,M), the ball centered at x of 
radius M. Denote for iG > 0, the oscillation of a function / G C'°(R'^) in the ball 
B{0,K) by osckH) = sup 3 (o_^)(/) — infB(o,ic)(/)- Denote the oscillation of a 
bounded function / G C'°(R'^) on R'^ by oscRd(/) = supgd(/) — infRd(/). In the 
sequel, we take the convention that J2p = 0 and = 1, for n,p G N, n < p. 

2. General conditions for the convergence of ULA 

In this section, we derive a bound on the convergence of the ULA to the target 
distribution tt when the Langevin diffusion is geometrically ergodic and the 
Markov kernel associated with the EM discretization satisfies a Foster-Lyapunov 
drift inequality. 

Consider the following assumption on the potential U: 

LI. The function U is continuously differentiable on R'^ and gradient Lipschitz, 
i.e. there exists L >0 such that for all x,y G R'^, 


\\VU{x)-VU{y)\\<L\\x-y\\ . 


Under LI, by [20, Theorem 2.4-3.1] for every initial point x G R'^, there exists 
a unique strong solution (Yt{x))t>o to the Langevin SDE (1). Define for all 
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t >0, X and A G Pt{x,A) = P(l^(a;) G A). The semigroup {Pt)t>o 

is reversible w.r.t. tt, and hence admits tt as its (unique) invariant distribution. 
In this section, we consider the case where {Pt)t>o is geometrically ergodic, 
i.e. there exists k G [0,1) such that for any initial distribution /tq and t > 0, 

\\^J-oPt-n\\^y <C{^J,o)K* , (3) 

for some constant C(^o) G [0,+oo]. Denote by the generator associated 
with the semigroup {Pt)t>o, given for all / G by 

^^f = -{VU,Vf) + Af. 

A twice continuously differentiable function V : —>■ [l,oo) is a Lyapunov 

function for the generator if there exist 9 > 0, fi > 0 and £ C B such that, 

<-eV + pie . (4) 

By [35, Theorem 2.2], if £ in (4) is a non-empty compact set, then the Langevin 
diffusion is geometrically ergodic. 

Consider now the EM discretization of the diffusion (2). Let {'jk)k>i be a 
sequence of positive and nonincreasing step sizes and for 0 < n < p, denote by 

p 

Ln.p = ^ ^ qfc , Lti = Ti^Ti . (5) 

k—n 

For 7 > 0, consider the Markov kernel Rj given for all A G B{M.'^) and a; G 
by 

Rj{x,A) = ^(4717)“"*/^ exp (^-( 47 )“^ ||i/- a;-b 7 VC/(a;)||^) dy . 

The discretized Langevin diffusion (A„)„>o given in (2) is a time-inhomogeneous 
Markov chain, for p > n > 1 and / G F+(IR‘^), [f{Xp)] = Ql^’Pf(Xn) where 

Pn = cr{Xi, 0 < i < n) and 

with the convention that for n,p > 0, n < p, is the identity operator. Under 
Ll, the Markov kernel Rj is strongly Feller, irreducible and strongly aperiodic. 
We will say that a function U : —>■ [1, 00 ) satisfies a Foster-Lyapunov drift 

condition for R^ if there exist constants 7 > 0, A G [0,1) and c > 0 such that, 
for all 7 G (0, 7 ] 

R^V < A'>'U -b 7 c . ( 6 ) 

The particular form of ( 6 ) reflects how the mixing rate of the Markov chain 
depends upon the step size 7 > 0. If 7 = 0, then Ro{x, A) = 6x{A) for a; G R^^ and 
A G ^(R^^). A Markov chain with transition kernel Rq is not mixing. Intuitively, 
as 7 gets larger, then it is expected that the mixing of Rj increases. If for some 
7 > 0 , R-y satisfies ( 6 ), then R^ admits a unique stationary distribution 7 r.y. 
We use ( 6 ) to control quantitatively the moments of the time-inhomogeneous 
chain. The types of bounds which are needed, are summarised in the following 
elementary Lemma. 
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Lemma 1. Let 7 > 0. Assume that for all x and 7 G (0,7], (6) holds for 
some constants A G (0,1) and c > 0. Let {'jk)k>i be a sequence of nonincreasing 
step sizes such that 7^ G (0,7] for all k G N* . Then for all n > 0 and x G 
QjV{x) < F{X,Tn,c,"fi,V{x)) where 

F{X, a, c, 7, w) = X°‘w + c(-A'>' log(A))“^ . (7) 

Proof. The proof is postponed to Section 4.1. □ 

Note that Lemma 1 implies that supf.>Q{(3!^y(a;)} < G{X,c,"fi,V{x)) where 

G(A,c, 7 ,w;) = w + c(-A'^log(A))“^ . (8) 

We give below the main ingredients which are needed to obtain a quantitative 
bound for \\5xQ^ — ttHtv for all x G This quantity is decomposed as follows: 
for all 0 < n < p, 


\\5xQflf — ttIItv 

< - ttIItv ■ (9) 

To control the first term on the right hand side, we use a method introduced in 
[13] and elaborated in [12]. The second term is bounded using the convergence 
of the semi-group to tt, see (3). 

Proposition 2. Assume that LI and (3) hold. Let {'yk)k>o be a sequence of 
nonnegative step sizes. Then for all x G n > 0, p > 1, n < p, 


ll<5xg?]-^llTv 


/p-i \ 1/2 

^ 2 ^ {(7^+i/3)A(7,x)- bd7fc-Hi} ) + , (10) 

\k—n / 


where k,C{5xQV:) are defined in (3) and 


A{j,x)=svip[ llV17(z)l|^Q*(?/,dz) . (11) 

k>0 jRrf 


Proof. The proof follows the same lines as [12, Lemma 2] but is given for com¬ 
pleteness. For 0 < s < t, let C([s, t], R'^) be the space of continuous func¬ 
tions on [s,t] taking values in R'^. For all y G R'^, denote by plf p and fi-f p 
the laws on C([FTi,rp] ,R‘^) of the Langevin diffusion (Ft(p))r„<t<rp and of the 
continuously-interpolated Euler discretization (bt(?/))r„<t<rp, both started at 
y at time r„. Denote by (Ft(p),Tt(y))t>r„ the unique strong solution started 
at (p, y) at time t = r„ of the time-inhomogeneous diffusion defined for t > F^, 

by 

r dF, = - W(F0dt + V2ABt 

\ dYt = -VU{Y, t)dt + V2dBf , ^ 
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where for any continuous function w : R+ —>■ and t > r„ 

OO 

W(w,t) = ^ VC/(wrJl[r..r.+,)W ■ (13) 

k—n 

Girsanov’s Theorem [21, Theorem 5.1, Corollary 5.16, Chapter 3] shows that 
p and are mutually absolutely continuous and in addition, p-almost 
surely 






n.p 

= exp 


1 


NuiZiy)) - vc/(r(y), s), di;(j/)) 

{\\VU{Ys{y))\\" -\\Wi{Y{y),s)\\"}ds 


Under LI, (14) implies for all y £ 


pd. 


KLiyyjm^j,) < 4 


E 


ds 


P-l pTk+i 

p-1 


ds 


(14) 


VU{My))-VU{Y{y),s) 

VU{Z{y))-VU{YrM) 

<4-iL2^d(^3^^/3) f \\yu{z)fQ';+^-'^{y,dz)+dj, 

k=n ( 


(15) 


By the Pinsker inequality, \\SyQ!^+^'P - ^yPr„+i,„ ||tv < ^{KL(^(( p)}^/^. 

The proof is concluded by combining this inequality, (15) and (3) in (9). □ 

In the sequel, depending on the conditions on the potential U and the tech¬ 
niques of proof, for any given x £ K^, C{6xQVj) can have two kinds of upper 
bounds, either of the form — log(7„)lU(a;), or exp(ar„)lU(x), for some function 
IT : —>■ R and a > 0. In both cases, as shown in Proposition 3, it is possi¬ 
ble to choose n as a function of p, so that limp^+oo W^xQ^ — ttHtv = 0 under 
appropriate conditions on the sequence of step sizes {yk)k>i- 

Proposition 3. Assume that LI and (3) hold. Let {'jk)k>i be a nonincreasing 
sequence satisfying linik^^ooLk = +oo ond limfe_>.oo 7^ = 0. Then, lim„_>oo \\SxQj 
’’’IItv = 0 for any x £ R'^ for which one of the two following conditions holds: 

(i) A{'j,x) < OO andlimsup„_,._,_(^C'(i5a;(5")/(-log(7„)) < -l-oo, where A{'^,x) 
is defined in (11). 

fti) Efci7fc < +00, ^(7,0;) < OO and hmsup„_,+(^ log{C'(4Q")}/rn < 
+ 00 . 
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Proof. (i) There exists such that for all p > po, > 7p and . 

Therefore, we can define for all p > po, 

n(p) = min {/c e {0, • • • - 1} > 7fe+i} . (16) 

and n{p) > 1. We first show that liminfp_,.oo n{p) = oo. The proof goes by con¬ 
tradiction. If liminfp^oo n(p) < oo we could extract a bounded subsequence 
{n{pk))k>i- For such sequence, {jn{pk)+i)k>i is bounded away from 0, but 
limfe^+oo = 0 which yields to a contradiction. The definition of 

Ti{p) implies that k^"(p)’P < 7„(p), showing that 


lim sup 

p—>-+oo 

< limsup — I limsup {7n(p)(- log(7n(p)))} = 0 . 

p->-|-oo log(7„(p)) p_>.+oo 

On the other hand, since {'yk)k>i is nonincreasing, for any i > 2, 


'ffc — 4(p)+i^»i(p)-i-i,p — 7„(p)+i log(7n(p)-i-i)/log(^) • 

k—n{p)-\-l 


The proof follows from (10) using limp_^oo 7rt(p) = 0- 

(ii) For allp > 1, define n{p) = max(0, [log(rp)J). Note that since limfe_>+oo F^ 
+00, we have limp_>+oo f^(p) = +oo. Using J2k=i^k < (7fc)/c>i is a 

nonincreasing sequence, we get for all £ > 2, 


p 


lim 


p—>-+oo 


E = 

k—n{p) 


which shows that the first term in the right side of (10) goes to 0 as p goes to 
infinity. As for the second term, since limsup^^+j^ log{C'(4Q")}/r„ < -l-oo, 
we get using that {jk)k>i is nonincreasing and n{p) < log(rp), 


C{6^Q:;^p^)k^-m.p 


(log(K)rp + [{iog(C(4Q!;<^^))/r„(p)}+ - log(K)] r„(p)) 


< exp I 


< exp (log(K)rp -I- 


sup{iog(C'(4Q!;))/rfe}+ - log(K) 

k>l 


Fn.lp) 

71 log(rp) 


Using K < 1 and limfe_>,_|_oo Fj, = -foo, we have limp_,,_|_oo = 0, 

which concludes the proof. 

□ 

Using (10), we can also assess the convergence of the algorithm for constant 
step sizes 7fc = 7 for all fc > 1. Two different kinds of results can be derived. 
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First, for a given precision e > 0, we can try to optimize the step size 7 to 
minimize the number of iterations p required to achieve \\SxQj — ti'IItv ^ £• 
Second if the total number of iterations is fixed p > 1, we may determine the 
step size 7 > 0 which minimizes \\5xQ^ — ttUtv- 

Lemma 4. Assume that (10) holds. Assume that there exists 7 > 0 such that 
C{x) = sup^g(Q ,^] sup„>i C{5xRV^) < +00 and sup^g(g A(^,x) < A{x), where 
C(5xRV^) andA(j,x) are defined in (3) and (11) respectively. Then for alls > 0, 
we get \\SxRp - ttHtv < £ */ 


p > T7 ^ 


and 7 < 


-d+^/d'^ + (2/3)A(x)e2(L2T) 
2A{x)/3 


-1 


A 7 


(17) 


where 

T = (log{C'(a;)} - log(£/ 2 )) j (- log(K)) . 

Proof. For p > T^~^, set n = p — [F7~^J • Then using the stated expressions of 
7 and T in ( 10 ) concludes the proof. □ 

Note that an upper bound for 7 defined in ( 17 ) is e^{LfTd)~^. The depen¬ 
dency of T on the dimension d will be addressed in Section 3 . 

Lemma 5. Assume that LI and (3) hold. In addition, assume that there exist 
7 > 0 and n € N, n > 0, such that Cn(x) = sup..yg(o,7] C'(SxRj) < +00 and 
sup.^g(Q Aiff, x) < A{x). For all p > n and all x G , if j = log(p — 'ii){{p — 
n)(—log(K))}“^ < 7, then 


WSxRP-ttWtv 

< {p-n)~^/^{Cn{x){p-n)~^^'^-\-log{p-n){d-\-A{x)log{p-n){p-n)~^)^/'^} . 

Proof. The proof is a straightforward calculation using (10). □ 

To get quantitative bounds for the total variation distance \\SxQf — ttHtv it 
is therefore required to get bounds on k, A{^,x) and to control C(SxQj). We 
will consider in the sequel two different approaches to get (3), one based on 
functional inequalities, the other on coupling techniques. We will consider also 
increasingly stringent assumptions for the potential U. Whereas we will always 
obtain the same type of exponential bounds, the dependency of the constants 
on the dimension will be markedly different. In the worst case, the dependency 
is exponential. It is polynomial when U is convex. 


3. Practical conditions for geometric ergodicity of the Langevin 
diffusion and their consequences for ULA 

3.1. Superexponential densities 

Assume first that the potential is superexponential outside a ball. This is a 
rather weak assumption (we do not assume convexity here). 
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HI. The potential U is twice continuously differentiable and there exist p > Q, 
a G ( 1 , 2 ] and Mp > 0 such that for all x G \\x — x*|| > Mp, {yU{x), x — x*) > 
p\\x-x*\\°‘. 

The price to pay will be constants which are exponential in the dimension. 
Under HI, the potential U is unbounded off compact set. Since U is continuous, 
it has a global minimizer x*, which is a point at which VU{x*) = 0. Without 
loss of generality, it is assumed that U{x*) = 0. 

Lemma 6 . Assume LI and HI. Then for all x G 
U{x) > p\\x — x*\\°'/{a-\-1) — Oa with Oa = pMp /{a + 1) + MpL/2 . (18) 
Proof. The elementary proof is postponed to Section 4.2. □ 

Following [35, Theorem 2.3], we first establish a drift condition for the diffu¬ 
sion. 

Proposition 7. Assume L 1 and H 1. For any ^ G (0,1), the drift condi¬ 
tion (4) is satisfied with the Lyapunov function V^{x) = exp(cC/(x)), 6^ = 

i^dL, = P{x*,K^), = max{{2dL/{p{l — and j3^ = 

supj-j^g£^}{V((?/)}. Moreover, there exist constants < oo and > 0 such 
that for all t G M+ and probability measures po and i/q on satisfying 

Aio(K) + ^^o(K) < +00; 

ll^oPt — r'oPtllK ^ — ^'oUk , H^oPt — ’’■IIk < • 

Proof. The proof, adapted from [35, Theorem 2.3] and [31, Theorem 6.1], is 

postponed to Section 4.3. □ 

Under HI, explicit expressions for and have been developed in the 
literature but these estimates are in general very conservative. We now turn to 
establish ( 6 ) for the Euler discretization. 

Proposition 8 . Assume Ll and HI. Let 7 G (0,L“^). For all 7 G ( 0 , 7 ] 
and X G R.'^, R-y satisfies the drift condition ( 6 ) with V{x) = exp{U{x)/2), 
K = max(Mp,(81og(A)/p2)i/(2(a-i)))^ ^ = -2 log(A)A-^ sup{ygB(x*.if)} 
and A = _ 


Proof. The proof is postponed to Section 4.4. □ 

Theorem 9. Assume Ll and HI. Let {fk)k>i be a nonincreasing sequence with 
7 i < 7 ) 7 € ( 0 , L~^). Then, for all n > 0, p > 1, n < p, and x G R^^, (10) holds 
with log(K) = —Ui /2 and 


A{-f,x) < L^ 





4(2 — a){a + 1) 
ap 


+ 21og{G(A,c,7i,U(a;))} 


2/a 


G(4Q!;) < Gi/ 2 F(A, ri,„, c, 71, ^(o^)) , (19) 

where C 1 / 2 , Vi /2 are given by Proposition 7, F by (7), V, A, c in Proposition 8, 
G by (8), Oa in (18). 
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Proof. The proof is postponed to Section 4.5. □ 

Equation (19) implies that for all x G we have sup„>g C(5a;(5I)) < 
G'(A, c, 7 i, E(a:)), so Proposition 3-(i) shows that limp_>+oo —7 i||tv = 0 for 

all a; G provided that limfe_>+oo 7 fc = 0 and limfc_>+oo P/c = +oo. In addition, 
for the case of constant step size 7 fe = 7 for all /c > 1, Lemma 4 and Lemma 5 
can be applied. 

Let V : ^ R, defined for all a; G R'^ by V{x) = exp(t/(a;)/2). By Propo¬ 

sition 7, {Pt)t>o is a contraction operator on the space of finite signed measure 
ft G Mg, < -|-oo, endowed with the norm || • ||yi/2. It is therefore possible 

to control WSxQf^ — 7 r||yi/ 2 . To simplify the notations, we limit our discussion to 
constant step sizes. 

Theorem 10. Assume LI and HI. Then, for all p > 1, x G R'^ and 7 G 
( 0 ,L- we have 

\\5xRl, - ^llvi /2 < C^i^rP-Py^l^ix) + Bin, Vix)) , (20) 

where log(/c) = —Ui/ 4 , ^ 1 / 4 , Ui/ 4 , 6 * 1 / 2 >/ 3 i /2 tti'e defined in Proposition 7, V, A, c 
in Proposition 8, G in ( 8 ) and 

B^ij,v) = L^max(l,C'i^/ 4 )(l -h7)(l - k)~^ (2G'(A, c, 7 , u) -h/3 i/2/6'i/2) 

X (7d-h3”^72||VC/|l^i/2 G(A,c,7,i;)) . 
Moreover, has a unique invariant distribution n-y and 

Ik-7^711^1/2 < B(7, 1) • 

Proof. The proof of (20) is postponed to Section 4.6. The bound for || 7 r— 7 rg,||yi /2 
is an easy consequence of (20): by Proposition 13 and [30, Theorem 16.0.1], R-y 
is l/^G_iiiiiforinly ergodic: hmp_i.+oo WSxR^ — 'x-yWv^/'^ = 0 for aU x G R^^. Finally, 
(20) shows that for all x G R"^, 

Ik - 7 ^ 4 , 11 ^ 1/2 < lim {\\5xRfy - + \\5xR^-t - < B{f,V{x)) . 

Taking the minimum over x G R"^ concludes the proof. □ 

Note that Theorem 10 implies that there exists a constant G > 0 which does 
not depend on 7 such that || 7 r — 7 r 4 ||yi /2 < G 7 ^/^. 

Remark 11. It is shown in [37, Theorem f] that for (f> G G°°(R‘^) with polyno¬ 
mial growth, (</)) — 7 r((/)) = 6 ( 0 ) 7 -|-G( 7 ^), for some constant b{(j)) G R, provided 
that U G G°°(R‘^) satisfies LI and HI. Our result does not match this bound 
since 5 ( 7 ,!) = 0 ( 7 ^/^). However the bound 5 ( 7 ,1) is uniform over the class 
of measurable functions 4> satisfying for all x G R'^, k(a:)| < V'^/'^ix). Obtain¬ 
ing such uniform bounds in total variation is important in Bayesian inference, 
for example to compute high posterior density credible regions. Our result also 
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strengthens and completes [29, Corollary 7.5], which states that under HI with 
a = 2, for any measurable funetions 4> : -A K satisfying for all x,y G 

\c[iw)-ct>{y)\<C\\x-y\\{l+\\xt + \\yt}, 

for some C > 0, k > 1, — 7i‘(</i’)| < C'7^ for some eonstants C > 0 and 

X G (0,1/2), which does not depend on 4>. 

The bounds in Theorem 9 and Theorem 10 depend upon the constants ap¬ 
pearing in Proposition 7 which are computable but are known to be pessimistic 
in general; see [36] . More explicit rates of convergence for the semigroup can be 
obtained using Poincare inequality; see [3], [9] and [4, Chapter 4] and the refer¬ 
ences therein. The probability measure tt is said to satisfy a Poincare inequality 
with the constant Cp if for every locally Lipschitz function h, 

YaT„{h}<Cp[ ||V/i(a;)f 7r(dx) . (21) 


This inequality implies by [9, Theorem 2.1] that for all t > 0 and any initial 
distribution /ig, such that fiQ ^ tt, 


WtioPt - ttUtv < exp(-t/Cp) (Var,, {d/ig/dTr})^/^ 


( 22 ) 


[2, Theorem 1.4] shows that if the Lyapunov condition (4) is satisfied, then the 
Poincare inequality (21) holds with an explicit constant. Denote by 


-d/2 


Dnil) = 471 < J|(l - Ljk) \ 


(23) 




2=1 


Theorem 12. Assume Ll and HI. Let {'yk)k>i be a non increasing sequence. 
Then for all n> 1 and x € Equation (3) holds with 


log(K) = (-6 'i /2 (l + 


- — Tmrm — 




where P is the Gamma funetion and the eonstants are given 

in Proposition 7 and (18) respeetively. 

Proof. The proof is postponed to Section 4.7. □ 

Note that for all x G R‘^,C{SxQj) satisfies the conditions of Proposition 3-(ii). 
Therefore using in addition the bound on Af-/, x) for all x G'R.‘^ and 7 G (O, 
given in Theorem 9, we get limfe^+00 \\5xQ^ - ttUtv = 0 if lim„^+oo r„ = -|-oo 
and lim„^+oo Yl=i ll < 
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3.2. Log-concave densities 


We now consider the following additional assumption. 

H2. U is convex and admits a minimizer x* for U. Moreover there exist rj > 0 
and > 0 such that for all x € \\x — a;*|| > Mrj, 

U{x)-U{x*)>f]\\x-x*\\ . (24) 

It is shown in [2, Lemma 2.2] that if U satisfies LI and is convex, then (24) 
holds for some constants t], M^i which depend in an intricate way on U. Since 
the constants 77, appear explicitly in the bounds we derive, we must assume 
that these constants are explicitly computable. We still assume in this section 
that U{x*) = 0. Define the function Wc : t [1, +00) for all a: S by 

Wcix) = exp((?7/4)(||a: - + 1)^/^) . (25) 


We now derive a drift inequality for R-/ under H2. 

Proposition 13. Assume LI and H2. Let 7 S (O, L ^]. Then for all 7 S (0,7], 


Wc satisfies ( 6 ) with X = e ^ 1)^ = max(l, 2^/77, M^), 

c = {( 7 ?/ 4 )(d + (777/4)) - log(A)} e''(«c+l)^'V4+(r,7/4)(d+(r,7/4)) ^ (26) 

Proof. The proof is postponed to Section 4.8 □ 

Corollary 14 . Assume LI and H 2 . Let {'Yk)k>i be a nonincreasing sequence 
with 7i < 7, 7 G (O, • Then, for all n > 0, p > 1, n < p, and x £ 

A(7,a:) = (477"^ [ 1 + log{G(A,c,7i, Wc(a:))}])^ , ( 27 ) 

where A(pi,x) is defined by ( 11 ) and G, Wc, X, c, are given in (8), ( 25 ), Propo¬ 
sition 13 respectively. 

Proof. The proof is postponed to Section 4 . 9 . □ 


If U is convex, [5, Theorem 1.2] shows that tt satisfies a Poincare inequality 
with a constant depending only on the variance of tt. 

Theorem 15. Assume LI and H2. Let {fk)k>i be a nonincreasing sequence 
with 7i < 7, 7 G (0, • Then, for all n > 0, p > 1, n < p, and x £ R.'^, (10) 

holds with A(j,x) given in (27), 


log(K) 


G(<5,g!;) 


-432 




X- yn{dy) 



-1 


/ (27r)(^+i)/2(d- 1)! 

[~^fr{(dTTm~ 


T^d/2Md \ 

r(d/2 + i)j 


D„( 7 )exp( 17 (x)) , 


(28a) 

(28b) 


where Dniff) is given in (23). 
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Proof. The proof is postponed to Section 4.10. □ 

For all X G C{6xQ'f) satisfies the conditions of Proposition 3-(ii). There¬ 
fore, if lim„^+oo r„ = -1-00 and hm„^+oo Z]fe=i 7fe < we get linifc_>+oo \\5xQ%- 
ttIItv = 0. 

There are two difficulties when applying Theorem 15. First the Poincare 
constant (28a) is in closed form but is not computable, although it can be 
bounded by a 0{d~^) . Second, the bound of Var7r{dJa;Q”/d7r} is likely to 
be suboptimal. To circumvent these two issues, we now give new quantitative 
results on the convergence of {Pt)t>o to tt in total variation. Instead of using 
functional inequality, we use in the proof the coupling by reflection, introduced 
in [27]. Define the function oj : (0,1) x —>■ R+ for all e S (0,1) and i? > 0, by 

ut{e,R) = R^/{2^-\l-e/2)}\ (29) 

where ^ is the cumulative distribution function of the standard Gaussian distri¬ 
bution and is the associated quantile function. Before stating the theorem, 
we first show that (4) holds and provide explicit expressions for the constants 
which come into play. These constants will be used to obtain the explicit con¬ 
vergence rate of the semigroup {Pt)t>o to tt which is derived in Theorem 17. 

Proposition 16. Assume LI and H2. Then Wc satisfies the drift condition 
(4) with 0 = 77^/8, £ = B{x*,K), K = max(l, M^, 4^/77) and 

fj = (77/4) {{r]/4:)K A- d) maxjl, (X^ -|- 1)“^/^ exp(77(iF^ -|- l)^/^/4)| . 

Proof. The proof is adapted from [2, Corollary 1.6] and is postponed to Sec¬ 
tion 4.11. □ 

Theorem 17. Assume LI and H 2. Then for all x € — Trjjrpy < 

2A(x)e~^^^^ + 4ru*, where 


log(ro) = -log(2)(6l/4) 


log{0-i/3(3 + 4e^~'®“(2 ^(8^)l°g(4e V)))|+log(2) 

A(a:) = (l/2)(lTc(x) +0-^) +20-l/3e4“'®‘"(2-^(8/’7)log(4e-l/3)) ^ 


(30a) 


(30b) 


the function Wc is defined in (25), the constants 9,j3 in Proposition 16. 

Proof. The proof is postponed to Section 5.1. □ 

Note that the bound, we obtain is a little different from (3). The initial 
condition is isolated on purpose to get a better bound. A consequence of this 
result is the following bound on the convergence of the sequence {SxQ'f)n>o to 

TT. 
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Corollary 18. Assume LI and H2. Let {'fk)k>o be a sequence of nonnegative 
step sizes. Then for all x S n>{),p>\,n<p, 

/p-i 

- ttIItv < 2-1/2L ( ^ {(7|+i/3)A(7,a;) +d7^+i} 

\k—n 

where A{f,x), w are given by (27) and (30a) respectively and 

A(4Q!;) = (1/2)(F(A, r„, 71, c, W'c(x)) + 0" V) 

+ , (31) 

the functions F and Wc are defined in (7) and (25), the constants X^c,6,j3 in 
Proposition 13 and Proposition 16 respectively. 

Proof. By Theorem 17, Proposition 13 and Lemma 1, we have for all x G 
||4Q:;Pr„+i„ - ttIItv < 2A(4Q!;)e-®^"+^-/4 + . 

Finally the proof follows the same line as the one of Proposition 2. □ 

Contrary to (28b), (31) is uniformly bounded in n. By Corollary 18 and 
(27), we can apply Proposition 3-(i), which implies the convergence to 0 of 
\\SxQ^ - Till TV as p goes to infinity, if limfe^.+oo 7fc = 0 and limfc_>+oo Pfe = +oo. 
Since log(/3) in Proposition 16 is of order d, we get that the rate of convergence 
log(ti7) is of order d~‘^ as d goes to infinity (note indeed that the leading term 
when d is large is 6u! (2“^, (8/7) log(40“^/3)) which is of order d^). In the case 
of constant step sizes 7fc = 7 for all A: > 0, we adapt Lemma 4 to the bound 
given by Corollary 18. 

Corollary 19. Assume LI and H2. Let (pfk)k>o be a sequence of nonnegative 
step sizes. Then for all e > 0, we get \\SxR?f — ttHtv < £ */p 7 satisfy (17) 

with 

T = max |40“^ log ^8e“^A(x)^ , log(16e“^) j (— log(ti7))| 

A(x) = (l/2)(G(A,7i,c,lFc(a;)) +6'”^/3) + ^ 

where A(7,x), vj are given by (27), (30a) respectively, the functions G and 
Wc are defined in (8) and (25), the constants \,c,6,j3 in Proposition 13 and 
Proposition 16 respectively. 

Proof. The proof follows the same line as the one of Lemma 4 using Corollary 18 
and that sup„>o A(d(3") < A(x) for all x G □ 

In particular, with the notation of Corollary 19, since max(log(/3),log(c)) 
and —(log(ci7))“^ are of order d and as d goes to infinity respectively, T is of 
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d 

£ 

L 

7 

1 

•TS 

0 

o(C/iog(£-A) 

0{L-'^) 

P 

0{d^) 


~~o(77)~ 


Table 1 

For constant step sizes, dependency of 7 and p in d, e and parameters of U to get 
\\&xR^ — ttIItv < £ using Corollary 19 


order cP. Therefore, 7 defined by (17) is of order d~^ which implies a number 
of iteration p of order to get \\5xQ^ — ttHtv < e for e > 0; see also Table 1. 

Corollary 19 can be compared with the results which establishes the depen¬ 
dency on the dimension for two kinds of Metropolis-Hastings algorithms to sam¬ 
ple from a log-concave density: the random walk Metropolis algorithm (RWM) 
and the hit-and-run algorithm. It has been shown in [28, Theorem 2.1] that 
for e > 0, the hit-and-run and the RWM reach a ball centered at tt, of radius 
e for the total variation distance, in a number of iteration p of order as d 
goes to infinity. It should be stressed that [28, Theorem 2.1] does not assume 
any kind of smoothness about the density tt contrary to Theorem 17. However, 
this result assumes that the target distribution is near-isotropic, i.e. there exists 
C > 0 which does not depend on the dimension such that for all a; G K^, 

C~^\\x\\^ < [ {x,yfTr{dy)<C\\xf . 

Note that this condition implies that the variance of tt is upper bounded by Cd. 

To conclude our study on convex potential, we also mention [8] which studies 
the sampling of the uniform distribution over a convex subset K C using 
coupling techniques. Let C > 0. A convex set K C is C-well rounded if 
B(0, 1) C K C B(0,C'fi). [8] shows that a number of iteration of order cP as d 
goes to infinity is sufficient to sample uniformly over any C-well rounded convex 
set. Comparison with our result is difficult since we assume that tt is positive 
on continuously differentiable, while [8] studies the case of uniform distri¬ 
butions over a convex body. An adaptation of our result to non continuously 
differentiable potentials will appear in a forthcoming paper [14]. 

3.3. Strongly log-concave densities 

More precise bounds can be obtained in the case where U is assumed to be 
strongly convex outside some ball; this assumption has been considered by [15] 
for convergence in the Wasserstein distance; see also [6] . 

H3 (Ms). U is convex and there exist Mg > 0 and m > 0, such that for all 
x, 2/ G satisfying \\x — y\\ > Mg, 

{VU{x)-VU{y),x-y)>m\\x-y\\'^ . 

We will see in the sequel that under this assumption the convergence rate in 
(3) does not depend on the dimension d but only on the constants m and Mg. 
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Proposition 20. Assume LI and H3(Ms). Let 7 G (0,2mL ^). For all 7 G 
(0,7], V{x) = \\x — x*\\^ satisfies (6) with X = and c = 2{d + mMf). 

Proof. The proof is postponed to Section 4.12. □ 

Theorem 21. Assume LI and H3(Ms). Let {'jk)k>i be a nonincreasing se¬ 
quence with 7i < 7, 7 G Then, for all n > 0, p > 1, n < p, and 

X G (10) holds with 


log(/c) = -(m/2)log(2) 


+ (l+niax(l,M,))} +log(2) 


C(4Q”) < 6 + 2 (d/m + m 2) / +2F^/^{l,Ti^r.,c,-,i,\\x-x^f) 

A{j,x) < L'^ G{X,c,ji,\\x - x*\f) , 


where F,G,uj are defined by (7), (8), (29) respectively, and X,c are given in 
Proposition 20. 

Proof. The proof is postponed to Section 5.1. □ 

Note that the conditions of Proposition 3-(i) are fulfilled. For constant step 
sizes 7fc = 7 for all A: > 1, Lemma 4 and Lemma 5 can be applied. We give 
in Table 2 the dependency of the step size 7 > 0 and the minimum number 
of iterations p > 0, provided in Lemma 4, on the dimension d and the other 
constants related to U, to get \\SxQf — ttHtv < for a target precision e > 0. 
We can see that the dependency on the dimension is milder than for the convex 
case. The number of iteration requires to reach a target precision e is just of 
order O{d\og{d)). 

Consider the case where tt is the d-dimensional standard Gaussian distribu¬ 
tion. Then for all p G N, 7 G (0,1) and x G is the d-dimensional 

Gaussian distribution with mean (1 — 7)^2: and covariance matrix a^ld, with 
cr.y = (1 — (1 — 7)^(P+^^)(1 — 7/2)“^. Therefore using the Pinsker inequality, we 
get: 


ll4i?P-^|lTV<2KL(d,i?P|^) 


< d 


log (cr^) - 1 -t cr.^ ^ |l -t (1 - 7)2p ||a;f d 


Using the inequalities for all t G (0,1), (1 — t) ^ < 1 -b t(l — t) ^ and for all 
s G (0,1/2), — log(l — s) < s -b 2 s 2, we have: 


\\Sx^ - ttII^v < d {7V2 + (1 - - 7 / 2)(1 - (1 - 7 )^(^+'))-^} 

+ af\l-jr\\xf . 

This inequality implies that in order to have — ttHtv ^ £ for £ > 0, the 

step size 7 has to be of order d“^/2 and p of order d^/2iog(d). Therefore, the 
dependency on the dimension reported in Table 2 does not match this particular 
example. However it does not imply that this dependency can be improved. 
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d 

£• 

L 

m 

Ms 

7 

O(d-I) 

0{ey\og{e-A) 


0(m) 


P 

C>(dlog(d)) 

0{e-Aog^{e-A) 

0{L-‘) 


0(MS) 


Table 2 

For constant step sizes, dependency of 7 and p in d, e and parameters of U to get 
||< 5 a:Q 7 — ttIItv < £ Using Theorem 21 


3.4- Bounded perturbation of strongly log-concave densities 


We now consider the case where C/ is a bounded perturbation of a strongly 
convex potential. 

H4. The potential U may he expressed as U = Ui U 2 , where 

(a) Ui : -A K satisfies H3(0) (i.e. is strongly convex) and there exists 

Li > 0 such that for all x,y £ 

\\VU,{x)-VU,{y)\\<L,\\x-y\\ . 

(b) 1/2 : —>■ R is continuously differentiable and ||172|loo + ll^^ 2 ||oo < +00 . 

The probability measure tt is said to satisfy a log-Sobolev inequality with 
constant Cls > 0 if for all locally Lipschitz function h : R^^ —>■ R, we have 

EnW (h^) < 2Cls J ||Vhf d^. 


Then [9, Theorem 2.7] shows that for all t > 0 and any probability measure 
/iQ ^ tt satisfying d/io/d 7 rlog(d/xo/d 7 r) S L^( 7 r), we have 


ll^o-Pt - ttIItv < e 


1 2 Ent^ 




(32) 


Under H4, [4, Corollary 5.7.2] and the Holley-Stroock perturbation principle 
[19, p. 1184], TT satisfies a log-Sobolev inequality with a constant which only 
depends on the strong convexity constant to of C/i and oscK<i(172). Define 


vu = 


2mLi 
m-\- Li 


(33) 


Denote by x\ the minimizer of Ui. 

Proposition 22. Assume H). Let {'fk)k>i be a nonincreasing sequence with 
7i ^ 2l{m -\- Li). Then for all p > 1 and x € R'^, 


\\y - x\\\^ Q^x,Ay) < ^[(1 


k^l 


ro7fe/2) l|x - x\\\^ 

+ 2w-\2d+{-ti+2vj-^)\\VU2tj . 


imsart-generic ver. 2014/10/16 file: main.tex date: December 20, 2016 











A. Durmus, E. Moulines/Non-asymptotic convergence analysis for the ULA 


18 


Proof. The proof is postponed to Section 4.13. □ 

Theorem 23. Assume LI and H4. Let be a nonincreasing sequence 

with 7 i < 2/(m + Li). Then, for all n,p > 1, n < p, and x G (10) holds 
with — log(K) = TOexp{—oscRd(t/ 2 )} and 

CHS^Q:;) < ||x - xlf + Li7„(7n + 2w-^) WVU^C + 2osc^4U2) 

+ 2Liw~^{l - W7„)(2d + (7i + 2tz7“^) HVC/all^) - d{l + log( 27 „m) - 2Li7„) 

Aij,x) < 2Ll {\\xl - + 2t^-i(2d+ (71 + 2w-^) IIVC/ 2 IIL)} + 2 , 

where w is defined in (33). 

Proof. The proof is postponed to Section 4.14. □ 

Note that sup„>i{C'(5x(5")/(—log( 7 „))} < + 00 , therefore Proposition 3- 
(i) can be applied and linip^+oo \\S.yQP — ttHtv = 0 if limfc ^+00 7 ^ = 0 and 
limfe_>+oo Tfe = + 00 . 

4. Proofs 

4.I. Proof of Lemma 1 

By a straightforward induction, we get for all n > 0 and x G 

n 

Q^Vix) < X^-Vix) + c Y, 7 .. (34) 

Note that for all n > 1, we have since {'yk)k>i is nonincreasing and for all ^ > 0, 
A‘ = 1 + /o‘ log(A)ds, 

n n n 

^7,A^*+i-" n log(A)7i) 

i—1 2=1 

n i n n 

< (-A'>'i log(A))"^ ^ n log(A) 7 j) - ]^(1 + A^i log(A) 7 j) 

i=l [j=z+l j=i 

< (-A-^i log(A))-i . 

The proof is then completed using this inequality in (34). 
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4^.2. Proof of Lemma 6 

By LI, HI, the Cauchy-Schwarz inequality and VU{x*) = 0, for all x S 
||a^|| > Mp, we have 

U{x) — U{x*)= f {VU{x*t{x — x*)),x — X*) dt 

Jo 

Mp 

> f iyu{x*-\-t{x — x*)),x — x*)dt 

Jo 

+ y ^ {^(/{x*t{x — x^)),t{x — X*)) dt 

> -M^pL/2 + p\\x- x*r (« + 1)-' {1 - {Mp/ ||x - x*||r+i} . 

On the other hand using again LI, the Cauchy-Schwarz inequality and VC(x*) = 
0, for all x G B(x*,Mp), 

U{x) - U{x*) = [ {VU{x* + t{x - X*)), X - X*) dt > -M^L/2 , 

Jo 

which concludes the proof. 


4 . 3 . Proof of Proposition 7 

For all X G we have 

^^V,{x) = c(l - c) {- ||VC(x)f + (1 - 0“'AC/(x)} V,{x) . 

If a > 1, by the Cauchy-Schwarz inequality, under L 1-H 1 for all x G 
A[/(x) < dL and ||VC(x)|| > p||x —x*||“~^ for ||x —x*|| > Mp. Then, for all 

X ^ S^, 

£4^V^{x) < c(l - <i) |-/o llx - + (1 - 14 {x) < -c,dLVc;{x) , 

and sup{,,g£j < c;dLsup[y^£^y{V^{y)}. 

4.4- Proof of Proposition 8 

By HI, for all x ^ B(x*,Mp), 

||VC/(x)||>p||x-x*ir^ . (35) 
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Since under LI, for all x, y G J 7 (j/) < U{x) + (yU{x),y — x) + {L/ 2 )\\y — x\\^, 
we have for all 7 G (0,7) and x G 

R^V{x)/V{x) 

= (4717)""*/^ exp (^{U{y) - U{x)} /2 - (47)"^ ||y - a; + 7Vt/(a;)||^) dy 

< (47r7)-'^/2 f exp (- 4 "^ 7 ||V 17 (x)f - (47)"1(1 - 7L) ||y-x||^) dy 

< (l-7L)"‘^/2exp(-4"i7||V17(x)f) , 

where we used in the last line that 7 < L~^. Since log(l — Lj) = —L fj(l — 
Lt)~^dt, for all 7 G (0,7], log(l — Lj) > —Lj(l — Using this inequality, 

we get 

R^V{x)/V{x) < A"^ exp (- 4 "^ II VC/(x)f) . ( 36 ) 

By ( 35 ), for all x G ||x — x*|| > K, we have 

R-yVix) < X^Vix) . ( 37 ) 

Also by ( 36 ) and since for alH > 0 , e* — 1 < te‘, we get for all x G 

R^V{x) - \^V{x) < A^(A- 2 ^ - l)U(x) < -27log(A)A-^U(x) . 

The proof is completed combining the last inequality and ( 37 ). 


4 . 5 . Proof of Theorem 9 


We first bound ^(7, x) for all x G Let x G By LI, we haveEa:[|| V17(Xfc)||^] 
L^E2,[||Xfc — x*|p]. Consider now the function (pa : M+ —)• M+ defined for 
all t > 0 by (pa{t) = exp(Aa(t + i3a)“/^) where A„ = pl{2{a + 1)) and 
Ba = {{2 —a)/{aAaN^°‘■ Since <pa is convex and invertible on R+, we get 
using the Jensen inequality and Lemma 6 for all fc > 0: 

M\\Xk - x*f ] < C' (ll^fe - )]) < C' (e““/"+®“''E,[U(Xfc)]) , 


where U(x) = exp({7(x)/2). Using that for all t > 0, (pa^{t) < (A^,^ log(f))^/“ 
and Lemma 1, we get 


supEa;[||Xfc - x*|p] < aa/2 + 5“/^ + log{G(A,c( 7 i),y(x))} ) 

k>0 ^ L ^ 


1 N 2/a 


Eq. (19) follows from Proposition 7, Proposition 8 and Lemma 1. 
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4^.6. Proof of Theorem 10 


Lemma 24. Let ft and v be two probability measures on and V : 

-A [l,oo) be a measurable function. Then 

\\h - < V2{uiV^) + . 

Proof. Without losing any generality, we assume that ft v. For all t € [0,1], 
tlog(t) — t + 1 = j^{u — t)u~^du > 2“^(1 — t)^, and on [l,+oo), t >->• 2(1 + 
t){t\og{t) — t + 1) — (1 — is nonincreasing. Therefore, for all t > 0, 

|1 - t| < (2(1 + t)(tlog(t) - t + 1))^^^ . (38) 

Then, we have: 


\\h-^\\v= sup 

/GF(R'i).||/||v'<l 


f f{x)dfj,{x) - f f{x)diy{x) 
Js.'i Js.'i 


sup 

/eF(R<i),||/||v<l 


/R'i 


fix) 


_ 

\d:/ 


1 > di^{x) 


< 


V{a 


(k _ 1 

dz/ 


dr{x) . 


Using (38) and the Cauchy-Schwarz inequality in the previous inequality con¬ 
cludes the proof. □ 


Proof of Theorem 10. First note that by the triangle inequality and Proposi¬ 
tion 7, for all p > 1 

Ik - kO^lki /2 < Cy^Kr<V^/\x) + IlkPr, - kQ^lki /2 . (39) 

We now bound the second term of the right hand side. Let k = di 

and r.^ be respectively the quotient and the remainder of the Euclidean division 
of p by k. The triangle inequality implies UkUpp — SxQjWvi/^ < A A- B with 


A = 




yi/2 


q-y 






yi/2 


It follows from Proposition 7 and > 7 ^ that 


2=1 


S Pr —SO 


\/l/2 


(40) 


We now bound each term of the sum in the right hand side. For all initial 
distribution i^q on (R^^, i3(]R‘^)) and i,j > 1, i < j, it follows from Lemma 24, 
[22, Theorem 4.1, Chapter 2] and (15): 

IkoQk 117/2 < 2 7og7(^) + i^on.,,(r))KL(z2ogk>oPr.j 

<2L2 {,yoQl;^iV) + tyoPr^jV)) 

X a-i) (l^d +sup voQl;^~W\^Uf) 

\ ke{i,--,j} 
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Proposition 7 implies by the proof of [31, Theorem 6.1] that for all y and 
t > 0: PtV{y) < V{y) + fixjijQxii- Then, using Proposition 8, Lemma 1 and 
k.y > 7“^ in (40), we get 


sup 

.q^} 




-l)k-y+l,ik.-y 


^xQ. 


ik-Y 


2 

yl/2 


<2 ^(1 + 7)L^ {2G(A, c,V{x)) + /3i/2/0i/2} 

X {7d + 3-i7"ll V17 ||^v2 G(A, c, ^(a;))} 


Finally, A can be bounded along the same lines. 


□ 


4-7. Proof of Theorem 12 

Denote for 7 > 0, r-,, : x —?> the transition density of Rry defined for 

x,y by 

rj{x,y) = (4jt7)“^exp(-(47)“^ ||j/- a; + 7V17(a;)|l^) . (41) 

For all n > 1, denote by g” : R'^ x R"^ —>■ R'^ the transition density associated 
with Q” defined by induction by: for all x, j/ G R'^ 

<l\{x,y) =rj^ix,y) , g"+^(x,y)=/ ql^i^x, z)r^„^-,{z,y)dz ior n > 1 . 

(42) 

Lemma 25. Assume LI. Let {'jk)k>i be a nonincreasing sequence with 71 < L. 
Then for all n > 1 and x,y G R'^, 

exp (2-^{U{x) - U{y)) - (2(t^,„)-i ||y- x||^j 

d^iXiU) ^ 7Z p-rn Y \\d/2 ’ 

(2Tza^^n[\i=iA- 

where 27i(l - ^70“^- 

Proof. Under LI, we have for all x,y G R^^, U{y) < U{x) + {'VU{x),y — x) + 
(L/2) lly — x||^, which implies that for all 7 G (0,L“^) 

r'y(x,y) < (4it7)“‘^/2exp (^2-^(U(x) - U{y)) - (1 - L7)(47)“i j]?/ - xf) . 

(43) 

Then, the proof of the claimed inequality is by induction. By (43), the inequality 
holds for n = 1. Now assume that it holds for n > 1. By induction hypothesis 
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and (43) applied for 7 = 7 n+ij we have 

|27tcr^,„P(l - L 7 ,)I exp(2-^(U(x)-U(y))) 

X [ exp ||z - a;|l^ - (1 - L7„+i)(47„+i)“^ ||z - ) dz 

r n 'I —dji 

< (4j17„+i)-‘^/2 J 2 itcr^ ^P(1 - L-ii) I + (1 - i7n+l)/(27n+l))"‘'^^ 

X (2it)'^/^exp (2“^(?7(x) - U{y)) - (2(t^,„+i)"^ ||y- a;||^) . 

Rearranging terms in the last inequality concludes the proof. □ 

Lemma 26. Assume LI and HI. Then < du where 


0 « ..(2.)l"')/^(d-l)! 

= ‘ ,-r((<i + i)/ 2 ) 


(44) 


and Ua is given in (18). 

Proof. By Lemma 6, for all x G U{x) > p\\x — x*|| /{a + 1) — Oq,. Using the 
spherical coordinates, we get 

J e-^(^)dy < e““ {(2jr)('^+i)/Vr((d + l)/2)} . 


Then the proof is concluded by a straightforward calculation. 


□ 


Corollary 27. Assume LI and HI. Let {'^k)k>i be a nonincreasing sequence 
with 7 i < L. Then for all n > 1 and x G R'*, 


Var, 


d(5a;Q" 

dTT 


-d/2 


< (du exp(U(x))) I 4k Et 




^ 1 


where Ajj is given by (44). 

Proof of Theorem 12. We bound the two terms of the right hand side of (10). 
The first term is dealt with the same reasoning as for the proof of Theorem 9. 
Regarding the second term, by [2, Theorem 1.4], tt satisfies a Poincare inequal¬ 
ity with constant log~^(K). Then, the claimed bound follows from (22) and 
Corollary 27. □ 


4 . 8 . Proof of Proposition 13 


Set X = 7/4 and for all x G R'^, (t>{x) = (||a: — a:*||^-|-l)^/^ . Since (j) is 1-Lipschitz, 
we have by the log-Sobolev inequality [7, Theorem 5.5] for all x G R'^, 


R.yWc{x) < \\x-'fVU(x)-x*\\‘^+2-id+i+x^i 


(45) 
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Under LI since [/ is convex and x* is a minimizer of U, [33, Theorem 2.1.5 
Equation (2.1.7)] shows that for all x € 

(V[/(x), X - X*) > (2L)-i II VC/(x)f + v\\x- x*|| , 

which implies that for all x G and 7 G (O, , we have 

||x - 7 Vt/(x) - x*f < ||x - x*f - 277 ||x - x*|| l{\\x-x*\\>M^} ■ (46) 

Using this inequality and for all u G [0,1], (1 — m)^/^ — 1 < —n/2, we have for 
all X G R'^, satisfying ||x — x*|| > Rc = max(l, 2dr]~^,Mrj), 

^||x — 7 VC/(x) — x*||^ + 27 ci+1^ — 4>{.x) 

< (j){x) I (1 - 2j(j)-^{x){T] ||x - x*|| - - l| 

< —^(j)~^{x){r] ||x — x*|| — d) < —( 777 / 2 ) ||x — x*|| (j)~^{x) < — 2“^/^777 . 
Combining this inequality and (45), we get for all x G R'^, ||x — x*|| > Rc, 

R-,Wc{x)/Wc{x) < = A'^ . 

By (46) and the inequality for all a,b > 0, Vo + 1 + 5 — < a 12, we get 

for all X G R'^, 

\j ||x — 7V17 (x) — x*||^ + 2'fd + 1 — (j){x) < 'yd . 

Then using this inequality in (45), we have for all x G R"^, 

R^Wcix) < AWc(x) + (e^'^('^+x) _ A^) ■ 

Using the inequality for alH > 0, e* — 1 < te* concludes the proof. 

4 . 9 . Proof of Corollary If 

We preface the proof by a Lemma. 

Lemma 28. Assume LI and that U is convex. Let ( 7 fe)fegN* be a nonincreasing 
sequence with 71 < L~^. For all n > 0 and x G R^, 

[ Q"(a^,d?/) < { 477-1 [1 +log {G(A,c, 71 , Wc(x))}]}^ , 

jR-i 

where lUc,A, c are given in (25) and Proposition 13 respectively. 
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Proof. Let n > 0 and x £ Consider the function : K —>■ K defined by 
for all t G R, = exp {(7y/4)(t + (4/??)^)^/^}. Since this function is convex 
on R+, we have by the Jensen inequality and the inequality for all t > 0, 
(fit) < ei+(’'/4)(*+i)'^"^ 

4> (^J^ \\y - x*f Q'f;{x,dy)'^ < e^Qf;Wc{x) . 

The proof is then completed using Proposition 13, Lemma 1 and that (f> is one- 
to-one with for all < > 1, < (4?7“^ log(t))^. □ 

Proof of Corollary I 4 . Using VU{x*) = 0, LI and Lemma 28, we have for all 

fc > 0, 

f ||VU(y)f g^(x,d2/) <Lq4ry-qi + log{G(A,c,7i,lUc(x))}})' . 

jRd' 

□ 


4 . 10 . Proof of Theorem 15 


We preface the proof by a Lemma. 

Lemma 29. Assume LI and that U is convex. Then 


/R'i 


< (qappapli + \ , 47 ) 

+ l)/2) V{dl2 + l)j ’ 


Proof. By (24) and U{x*) = 0, we have 


[ e-^^y^dy< [ e-''ll^-"*lldy+ / l{||,_,*||<M,}cly . 
jR‘i jR‘i Jr<‘ 

Then the proof is concluded using the spherical coordinates. 


□ 


Proof of Theorem 15. By [5, Theorem 1.2], tt satisfies a Poincare inequality with 
constant log“^(K). Therefore, the second term in (10) is dealt as in the proof of 
Theorem 12 using (22), Lemma 29 and Lemma 26. □ 


4 . 11 . Proof of Proposition 16 

For all a: G we have 

s^^Wc{x) =- f) - I_ x*\\^ + 1)“^/^ ||a: - a:*||^ 

4(||x-x*f+ 1)1/2 

- {VU{x),x - X*) - (||a: - + l)“i ||a: - + d| . 

By (24), {VU{x), x — x*) > r? ||a; — j;*|| for all x G ||a: — a;*|| > M,,. Then, 
for all X, llx —a:*|| > K = max(M,,,4d/ry, 1), £/^Wc(x) < —(rf^/8)Wc(x). In 
addition, since U is convex and VU{x*) = 0, for all x G R'^, {\7U{x), x — x*) > 0 
and we get sup^j.^^^ £/^Wc{x) < ft. 
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4-12. Proof of Proposition 20 

Under LI, using that VU{x*) = 0, we get for all x 


[ \\y - x*f R.y{x,dy) = \\x - X*-\--fiVU{x*)-VU{x))\f -\-2-fd 
Js.<‘ 

< (1 + (L7)2) ||a; - x*f - 2-/ {VU{x) - VU{x*), x - x*) + 2jd . (48) 

Then for all x € K.'^, ||a: — a:*|| > Mg, we get using for alH > 0, 1 — t < e“* 

[ \\y - ^ \\x-x*\\^ + 2 'yd. 

Using again (48) and the convexity of U, it yields for all x G ||a; — a::*|| < Mg, 

f \\y- Rj{x,dy) <-fC , 

Jr<‘ 

which concludes the proof. 


4-13. Proof of Proposition 22 

We preface the proof by a lemma. 

Lemma 30. Assume H4. Then, for all x G 

\\x--/yU{x)-xlf < (1 -^ 7 / 2 ) \\x-xl\\^ + "iff+ 2w~^) ||VC72|1^ . 

Proof. Using that for all y,z G \\y + z||^ < (I+CC 77 / 2 ) ||y||^ + (l+ 2 (n 77 )“^) ||z||^ 
we get under H4-(b): 

\\x — ^VU{x) — x*i\\^ < (1 + ^^ 7 / 2 ) ||a; — 'yVUi{x) — x^W'^ 

+ 7(7 + 21^-') IIVU 2 IIL ■ (49) 

By [33, Theorem 2.1.12, Theorem 2.1.9], H4-(b) implies that for all x,y G 

{VUi{y)-WUi{x),y-x) > (,,7/2) ||y - xf + ^—||VC/i(y) - Vt7i(x)f , 

Using this inequality and VUi{x*) = 0 in (49) concludes the proof. □ 

Proof of Proposition 22. For any 7 G (0, 2/(to + Li)), we have for all x G 

f \\y - xlf R.yix,dy) = \\x--fVU{x) - xl\f -\-2-fd 

< (1 - W 7 / 2 ) jja; - a;*f + 7 {(t + SW^) llVC/sjj^ + 2 d| , 
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where we have used Lemma 30 for the last inequality. Since 71 < 2/(m + Li) and 
{lk)k>i is nonincreasing, by a straightforward induction, for p > 1 and x G 


/ < Wil - w^k/r)\\x - x\\\^ 


P P 

+ ((7i+2^-i)||VC/2||L + 2d)E n (1 - tz77fc/2)7i . 

i—n k—i-\-l 


(50) 


Consider the second term in the right hand side of (50). Since 71 < 2/(m + Li), 
m < Li and ( 7 fe)fc>i is nonincreasing, maxfc>i jk < and therefore: 


p p 

E n (1 - vjjkl‘2.)l^ 

i—n k—i-\-l 


^Ei n (1 -^^7fc/2)- n (1 -n77fc/2) 


i—n \,k=i-\-l 


k—i 


< 2w ^ . 


□ 


4-14- Proof of Theorem 23 

We preface the proof of the Theorem by a preliminary lemma. 

Lemma 31. Assume H4. Let 7 £ (0, 2/ (to + Li)), then for all x £ K.'^, 

EnW ^ (Ei/2) {(1 - W 7 / 2 ) Ik - + 7(7 + 2tx7-1) ||V172||L} 

+ oscRd( 172 ) - {d/ 2 ){l + log(27TO) - 2L17) . 

Proof. Let 7 £ (0, 2/(to + Li)) and r^ be the transition density of R.y given 
by (41). Under H4-(a) by [33, Theorems 2.1.8-2.1.9], we have for all x £ 
Ui{x) < Ui{x4) + (Li/2) Ik — x\\\^ Therefore we have for all x 

EnU = J fog{r.y{x,y)/Tr{y))r.y{x,y)dx 

< R-^'ipix) - (d/2)(l log( 47 t 7 )) , (51) 

where ?/; : —>■ R is the function dehned for all y £ R'^ by 

k(j/) = U 2 {jj) + Ui[x\) + (Li/2) ||y - xlf + log (^J ^ . 
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By H4-(b) and Lemma 30, we get for all x G 

R-y^{x) < (Ll/2) \\x - 'yVUix) - a;/f + log f / 

yjR'i 

+ 0SC]uii(C/2) + dL\^ 

< (Li/ 2 ) |(1 - w-i/ 2 ) llx - x*if + 7(7 + 2w"^) II VUallL} 

+ osC]|jd(172) “t“ dL\^ . 

Plugging this bound in (51) gives the desired result. □ 

Proof of Theorem 23. We first deal with the second term in the right hand side 
of (10). Under H4, [4, Corollary 5.7.2] and the Holley-Stroock perturbation 
principle [19, p. 1184] show that tt satisfies a log-Sobolev inequality with constant 
Cls = — log~^(K). So by (32) we have 

||4Q!;Lt-^||TV<«‘|2EnW ^ . 

We now bound Ent,r {dSxQ'f /d'K') which will imply the upper bound oiC{5xQ^). 
We proceed by induction. Eor n = 1, it is Lemma 31. For n > 2, by (42) and 
the Jensen inequality applied to the convex function t 1 —>■ tlog(t), we have for 
all X G and n > 1, 

Ent, (dJ^QZ/d^) 

= [ log|7r-i(y) [ q”-^(x,z)r^„(z,y)dzl [ q:^-^{x, z)rj„{z,y)dzdy 
jR’i I JR<i J JR-i 

< [ [ \og{r^^{z,y)TT-^{y)] qf^-^{x,z)r^^{z,y)dzdy . (52) 

JR'^ 

Using Fubini’s theorem. Lemma 31, Proposition 22, and the inequality t > 0, 
1 — t < e“* in (52) implies the bound of C(Ja;(5"). 

Finally, 7l(7, x) is bounded using the inequality for all y, z G R'^, ||y + z||^ < 
2(l|y||^ + Ikll^): H4 and Proposition 22. □ 


5. Quantitative convergence bounds in total variation for diffusions 

In this part, we derived quantitative convergence results in total variation norm 
for d-dimensional SDEs of the form 

dXt =6(Xt)dt + dL/, (53) 

started at Xo, where {Bf)t>o is a d-dimensional standard Brownian motion and 
5 : R^^ —>■ R'^ satisfies the following assumptions. 

Gl. b is Lipsehitz and for all x, y G R^^, (5(x) — b{y), x — y) < 0. 
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Under Gl, [20, Theorems 2.4-3.1-6.1, Chapter IV] imply that there exists a 
unique solution (Xt)(>o to (53) for all initial point x G which is strongly 
Markovian. Denote by (Pt)t>o the transition semigroup associated with (53). 
To derive explicit bound for ||Pt(a;, •) — Pt(2/, •)||tv, we use the coupling by 
reflection, introduced in [27] to show convergence in total variation norm for 
solution of SDE, and recently used by [15] to obtain exponential convergence in 
the Wasserstein distance of order 1. This coupling is defined as (see [10, Example 
3.7]) the unique strong Markovian process (X^, Yt)j>o on solving the SDE; 


dXj = b{'X.t)dt + dBf 

dYi = b{Yt)dt + (Id -2eteJ)dBf , 


where e* = e(Xt - Yt) 


(54) 


with e{z) = zj jjzjj for z and e(0) = 0 otherwise. Define the coupling time 


T, = inf{s > 0 1 X, ^ YJ . (55) 

By construction Xj = Yt for t > Tc- We denote in the sequel by P{x,y) and 
]E( 3 , y) the probability and the expectation associated with the SDE (54) started 
at {x,y) G R^'^ on the canonical space of continuous function from R+ to R^'^. 
We denote by {J-t)t>o the canonical filtration. Since Bf = (Id —2esef)di3f is 
a d-dimensional Brownian motion, the marginal processes (Xt)t>o and (Yt)t>o 
are under ^{x,y) weak solutions to (53) started at x and y respectively. The 
results in [27] are derived under less stringent conditions than Gl, but do not 
provide quantitative estimates. 

Proposition 32 ([27, Example 5]). Assume Gl and let (X(,Y()t>o be the 
solution of (54). Then for all t >0 and x,y G R'^, we have 

P(.,y) (tc >t)= P(,,,) (Xt ^ YO < 2 I (2ti/2) jjx - 2/111 - 1/2^ . 

Proof. For t < Tc, Xt — Yj is the solution of the SDE 

d{Xt - YJ = {6(Xt) - b{Yt)} dt + 2etdBl , 


where B} = f* . Using the Ito’s formula and Gl, we have for all 

t < Tc, 


lX.-Ydl = 


II = ||x -yll+ f (b(X,) - b{Ys),es) ds + 2Bl < |]a; - y\\ 
Jo 


2Bl 


Therefore, for all x, y G R'^ and t > 0, we get 


P(x.y) {Tc >t) < P(3,^y) 


( min BI > ||x 

\o<s<t “ " 



(x,v) 


{ max B\ < ||x 

\0<s<t “ 



P(,,,)(|Bi|<||x-y|]/2) 


where we have used the reflection principle in the last identity. 


□ 
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Define for i? > 0 the set A/j = {x,y £ \ ||a: — y\\ < R}. Proposition 32 

and Lindvall’s inequality give that, for all e G (0,1) and t > uj{e, R), 

sup ||Pt(a;,-) - OIItv < 2(1 - e) , (56) 

{x,y)eAR 

where oj is dehned in (29). To obtain quantitative exponential bounds in total 
variation for any x,y £ it is required to control some exponential moments 
of the successive return times to A/j. This is first achieved by using a drift 
condition for the generator £/ associated with the SDE (53) defined for all 
/ G by 

■e^/ = (&,V/) + (l/2)A/. 

Consider the following assumption: 

G2. (i) There exist a twice continuously differentiable function V : i—> 

[l,oo) and constants 9 > 0, P > 0 such that 

< -9Y + P . (57) 

(ii) There exists <5 > 0 and R > 0 such that 0 C Ar where 

0 = {(x, y) G R^'^ I V{x) + V{y) < 20-^ + <5} . (58) 

For t > 0, and G a closed subset of R^'^, define by T^’* the Hrst return time 
to G delayed by t: 

Tf’‘ = inf{s>t I (X«,Y,)gG} . 


For j > 2, define recursively the j-th return time to G delayed by t by 


nG,i 


= inf{s > Tfl\ + t I (X„ Y,) G G} = T'fl\ + T^^’' o S. 


G,t 

3- 


pG,t 


(59) 


where S is the shift operator on the canonical space. By [16, Proposition 1.5 
Chapter 2], the sequence (T^’*)j>i is a sequence of stopping time with respect 
to t)t>o- 

Proposition 33. Assume G1 and G2. For all x,y £ R'^, e G (0,1) and j > 1, 
we have 


E(a;,y) 





< {K(e)p-i{(l/2)(P(cr) + P(y))+e^‘-(^’^)0-i/3} , 


9 = 9‘^5{2P + 95)-^ , K(e) = d”V (l + + 5/2 , (60) 

where uj is defined in (29) . 

Proof. For notational simplicity, set Tj = Note that for all x,y £ R'^, 


(x) + s^V(y) < -9{V(x) + V{y))+ 2pie{x, y) . 
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Then by the Dynkin formula (see e.g. [32, Eq. (8)]) the process 


t ^ (l/2)e«>i^‘) {E (XT,At) + 1/ (Ytia*)} , t > a;(e, R) 


is a positive supermartingale. Using the optional stopping theorem and the 


Markov property, we have, using that for all t > 0 ^{x,y) e®‘U(Xt) <V{x) + 


^ix,y) < (l/2)(U(cc) + V{y)) + . 


The result then follows from this inequality and the strong Markov property. □ 
Theorem 34. Assume G1 and G2. Then for all e G (0, 1), t > 0 and x, y G R.'^, 

||Pt(x,-) - Pt(y,-)I|TV < 2e-®‘/2 {(l/2)(U(x) + U(y)) + + 4 ac‘ , 

where uj is defined in (29), 0,K(e) in (60) and 

log(K) = (0/2) log(l - e){log(K(e)) - log(l - e)}"^ . 

Proof. Let x, y G and t > 0. For all .£ > 1 and e G (0,1), 

P(a:,y) (Tc > t) < P(3;,y) (Tc > t, T^ < <) + P(x.y) (T^ > t) , (61) 

where Te = bound the two terms in the right hand side of 

this equation. For the first term, since 0 C Ar, by (56), we have conditioning 
successively on for J = > 1) and using the strong Markov property. 


^{x,y) (Tc >t,Tt<t) <{1- eY . 


(62) 


For the second term, using Proposition 33 and the Markov inequality, we get 



{(l/2)(U(x) + U(y)) + 


+ e-^‘/2{K(e)}'-i . 


The proof is completed combining this inequality and (62) in (61) and taking 


£ = 2-i0t/(fog(K(e)) - log(l - e)) 


□ 


More precise bounds can be obtained under more stringent assumption on 
the drift 5; see [6] and [15]. 

G3. There exist Ms > 1 and rhs > 0, such that for all x,y G ||x — y|| > Ms, 


(6(x) - &(y), X - y) < -ms ||x - yf . 


Proposition 35. Assume Gl and G3. 
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(a) For all x,y and e G (0,1) 


E, 


{x,y) 


f 7 


exp ( ^ ( r, A 




< 1 + ||x - 2/11 + (1 + _ 


(h) For all x,y G e G (0,1) and j > 1 


E, 


(a;.y) 


exp ( (ms/2) A 

< {D(e)H-i {l + ||x - 2/11 + (1 + ^ 

D(e) = (1 + + iV4) , (63) 


where uj is given in (29). 
oof. In the proof, we set 

(a) Consider the sequence of increasing stopping time 

Tfe=inf{t>0| \\Xt-Yt\\^[k-\k]} , k>l, 


and set Ck = Tk A Ti. We derive a bound on E/^. j,)[exp{(?TT,s/2)^fc}] independent 
on k. Since limfc_>.+oo f Xk = Tc almost surely, the monotone convergence theo¬ 
rem implies that the same bound holds for [exp{(7fis/2)(TcATi)}]. Set now 
Ws(a7, y) = l + \\x — 2/11. Since Wg > 1 and Tc < oo a.s by Proposition 32, it suf- 
hces to give a bound on E/^, y)[exp{(?7is/2)^fc}Ws(X(^^, Y^^)]. By Ito’s formula, 
we have for all v,t <Tc, v <t 


emst/2w,(Xt,Yt) = e™=”/2Wg(X„,Y„) + (ms/2) [ e'^=“/2Ws(X„, Y„)du 

J V 

+ f - 6(Y„), eu) du + 2 f . (64) 

J V J V 

Using G3(6), we have for all fc > 1 and tg = io{e, Mg) < v < t 

r-CfcAt 


2 / . 


'CjsAk 


So the process 


{exp((?hg/2)(Cfc At)) Wg(X^^^At,Y(;^At)}t>t^ , 

is a positive supermartingale and by the optional stopping theorem, we get 


E(a;,y) 

e(™=/2)^'=Wg(Xa,Ycj' 
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where we used that Cfc A 4- By (64), G1 and G3, we have 


E 


{x,v) 


< W,(x, y) + {l + 


and (65) becomes 


E, 


{x,v) 


e(A=/2)aw,(Xc,,Ycjl < W,{x,y) + (1 + . 


(b) The proof is by induction. The case j = 1 has been established above. 
Now let j > 2. Since on the event {tc > Tj_i}, we have 


Tc A Tj = Tj_i + (tc a Ti) o Stj_i , 


where S is the shift operator, we have conditioning on using the strong 

Markov property. Proposition 32 and the first part. 


E, 


ix,v) 




< D(e)E(3,^j^) 




Then the proof follows since D(e) > 1. 


□ 


Theorem 36. Assume G1 and G3. Then for all e G (0,1), t > 0 and x,y G K.'^, 
||Pt(a;, •) - Pt(y, OUtv < 2 {(1 - e)"^ + 1 + ||a; - 2 /||} k* 

log(K) = (tos/ 2) log(l - e)(log(D(e)) - log(l - e))"^ , 
where D(e) is defined in (63). 

Proof. The proof is along the same lines as Theorem 34. Set Tj = T^ ^ 
for j > 1. Let x,y GMfi and t > 0. For all £ > 1 and e G (0,1), 

P(x,y) (Tc > t) < P(x,y) (Tc > t,Te < t) + P(x,y) (T^ A Tc > t) . (66) 


For the first term, by (56) we have conditioning successively on J-Tj , for j = 
£,■ ■ ■ ,1, and using the strong Markov property, 

P{x,y) {tc >t,Tt<t) <{l- ef . (67) 

For the second term, using Proposition 35-(b) and the Markov inequality, we 
get 


P{x,y){PiATc>t)<e ”^‘{D(e)}'^ ^|l + ||a: 


2/11 + (1 + 


Tns^(e, Afs) 
2 


( 68 ) 


Taking £ = [(7fisi/2)/(log(D(e)) — log(l — e))J and combining (67)-(68) in (66) 
concludes the proof. □ 
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5.1. Proof of Theorem 17 and Theorem 21 

Recall that (Pt)t>o is the Markov semigroup of the Langevin equation associated 
with U and let be the corresponding generator. Since {Pt)t>o is reversible 
with respect to tt, we deduce from Theorem 34 and Theorem 36 quantitative 
bounds for the exponential convergence of {Pt)t>o to tt in total variation noting 
that if (yt)t>o is a solution of ( 1 ), then (lt/ 2 )t>o is a weak solution of the 
rescaled Langevin diffusion: 

dft = -(l/2)V17(ft)dt + dBf . (69) 

Proof of Theorem 1 7. Since the generator associated with the SDE (69) is 
Proposition 16 shows that (57) holds for Wc with constants 9/2 and /3/2. Using 
that for all 01,02 G M, < (l/2)(e“i +e“=), G2-(ii) holds for 5 = 29~^j3 

and R = ( 8 / 77 ) log(40“^/3). By Theorem 34 with e = 1/2, we get for all x,y 
and t > 0 


ll-ft(a:,-) - Ptiy,-)\\TN < dw* 

+ 2e-®*/^{(l/2)(lUc(a:) + Wc(2/)) + 26»-i/3e^®”'“(2-M8/r;)iog(4e-i/3))| ^ 

where w is defined in (30a). By [32, Theorem 4.3-(ii)], (57) implies that Wc{y)n{dy) 
P9~^. The proof is then concluded using this bound, (70) and that tt is invariant 
for (Pt)t>o □ 

Proof of Theorem 21. By applying Theorem 36 with e = 1/2, the triangle in¬ 
equality and using that tt is invariant for {Pt)t>o, we have 

||Pt(a;, •) - ttIItv < 2 js -h ||a; - a;*|| -h \\y - x*\\ d7r(?/)| . 

It remains to show that \\y — x*\\dn{y) < {d/m A- For this, we 

establish a drift inequality for the generator of the Langevin SDE associated 
with U. Consider the function 144(a;) = Ik — For all a: G we have using 
VU{x*) = 0, 

£/^Wyx) < 2{d - {VU{x) - VU{x*), X - X*}) . 

Therefore by G3, for all x G |k — a;*|| > Mg, we get 
s/^Ws{x) < —2mWs{x) -\- 2d , 

and for all a: G 

£/^Ws{x) < —2m444(a;) -I- 2{d A- mM^) . 

By [32, Theorem 4.3-(ii)], we get 444(j/)d7r(j/) < d/m -A- The bound on 
C(SxQ") is a consequence of the Cauchy-Schwarz inequality. Proposition 20 and 
Lemma 1. The bound for A{j,x) similarly follows from LI, Proposition 20 and 
Lemma 1. 

□ 
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